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Expected  value  functions  as  functions  of  decisions  and  decision  strategies  are 
flat  around  their  maxima.  This  so  called  flat  maximum  phenomenon  has  been  discovered 
.n  sensitivity  analyses  in  virtually  all  decision  theoretic  paradigms.  But  until  now 
most  of  the  research  on  flat  maxima  explored  more  or  less  general  examples  and 
limiting  considerations.  Two  basic  questions  remained  unanswered:  what  are  the 
mathematical  reasons  for  the  restricted  shape  of  the.  evaluation  functions;  and  can 
these  restrictions  be  interpreted  as  flatness  in  a  psychological  sense?  While  the 
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second  qestion  calls  for  psychological  experimentation,  the  first  question  can 

be  answered  with  mathematical  tools.  The  present  article  shows  that  the  mathematical 

characteristics  of  linear  optimization  models  impose  severe  restrictions  on  the 

functions  evaluating  choice  alternatives  such  as  gambles,  multi-attributed  outcomes 

or  consumption  streams.  The  course  of  proof  of  this  argument  provides  a  helpful  * 

tool  for  sensitivity  analyses  m  decision  theory.  The  concepts  and  methods  are 

demonstrated  in  examples  from  statistical  decision  theory,  psychological  modeling 
and  applied  decision  theory.  P  ywiuj.ogicaj.  modeling. 
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Introduction 


Decision  theoretic  models,  such  as  expectation  models,  multi-attribute 
utility  models.,  or  time-discounting  models,  evaluate  decisions  or  decision  stra¬ 
tegies  with  a  real  number'd  index  of  preference  such  as  a  utility,  a  present 
value,  risk,  expected  value,  etc.  Most  prominent  decision  theoretic  models 
are  linear  decomposition  models  that  base  their  evaluation  of  choice  alternatives 
(a  generic  term  for  gambles,  multi-attributed  outcomes,  and  consumption  streams) 
on  a  weighted  additive  integration  of  subjective  or  objective  input  parameters, 
which  the  decision  maker  or  experts  provide  by  means  of  simple  choices  or  judg¬ 
mental  tasks.  For  example,  the  subjective  expected  utility  model  evaluates 
gambles  by  combining  subjective  probabilities  of  events  and  utilities  of  decision 
outcomes  into  expected  utilities. 

One  of  the  peculiarities  of  linear  optimization  models  is  the  flatness  of 
their  evaluation  function  in  the  area  of  optimal  choice  alternatives.  (We 
implicitly  assume  uere  and  in  the  following  that  chc  i.ce  alternatives  have  a 
continuous  or  dense  numerical  description  as  vectors,  decision  functions,  stop¬ 
ping  rules,  probability  cutofis,  etc.)  A  suboptimal  choice  does  not  seriously 
hurt  the  decision  maker  as  long  as  the  alternative  selected  is  not  grosslv  away 
from  the  optimum.  This  type  of  insensitivity  is  closely  linked  to  a  second  type, 
which  is  often  found  in  decision  analysis  settings.  Variations  of  model  para¬ 
meters  like  importance  weights  or  subjective  probabilities  seldom  produce 
drastic  changes  in  the  model  evaluation  function.  A  set  of  quite  different  para¬ 
meter  values  may  lead  to  the  selection  of  the  same  choice  alternative 5  and  even 
if  the  use  of  a  wrong  set,  of  parameter  values  leads  to  a  different  decision,  the 
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first  type  of  insensitivity  will  guarantee  that  the  loss  in  expected  value  as 
calculated  by  means  of  the  model  with  the  correct  parameters  will  be  rather 
small.  Some  researchers  ( Interna  and  Torgerson,  1961)  cave  even  argued  for  an 
insensitivity  across  models.  According  to  their  results  different  models 
should — under  some  mild  conditions — lead  to  similar  evaluations  and  decisions. 

Although  there  are  doubts  about  insensitivity  across  models  (see  Fischer, 
1972)  the  evidence  for  the  two  other  kinds  of  insensitivity  is  substantial. 

In  expectation  models  v.  Winter feldt  and  Edwards  (1975)  generalized  scattered 
findings  of  flat  expected  value  functions  as  functions  of  decisions  and  decision 
strategies.  In  multi-attribute  utility  theory  Fischer  (1972)  demonstrated  the 
insensitivity  of  multi-attribute  utility  functions  against  variations  in  param¬ 
eters  like  importance  weights  and  single  dimension  utilities. 

But  up  until  now  the  evidence  for  flat  maxima  was  based  on  more  or  less 
general  examples.  The  questions  remained  whether  or  not  flatness  is  a  ne¬ 
cessity  and  what  model  characteristics  cause  it.  Another  problem  with  the 
arguments  for  insensiti/ity  in  those  examples  is  the  concept  of  flatness  itself. 

A  function  may  look  flat,  but  that  can  easily  be  fixed  by  stretching  the  units 
of  the  ordinate  and  compressing  the  units  of  the  abscissa.  Flatness  is  not  a 
mathematical,  but  a  psychological  concept.  %  loss  may  be  substantial  for  one 
decision  maker  and  negligible  for  another. 

These  arguments  call  for  two  kinds  of  research  on  the  flax  maximum  phenom¬ 
enon:  first,  a  mathematical  analysis  that  proves  the  inevitability  of  restricted 
forms  of  the  evaluation  functions,  given  certain  model  characteristics,  and 
second,  an  experimental  psychological  analysis  that  shows  whether  or  not  these 
restrletiuns  can  be  int  "preted  as  flatness. 
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This  report  presents  the  mathematical  foundation  of  the  flat  maximum  phe¬ 
nomenon.  Integrating  some  theorems  from  statistical  decision  theory  it  shows 
that  the  nature  of  all  linear  optimization  models  imposes  severe  restrictions 
on  the  model  evaluation  function.  The  mathematical  proofs  produce  two  further 
important  and  practical  results:  they  establish  an  equivalence  between  model 
insensitivity  against  variations  in  choice  alternatives  and  against  variations 
in  parameter  values ;  and  they  present  the  tools  for  a  general  and  simple  ap¬ 
proach  to  sensitivity  analyses.  Some  examples  from  statistics,  psychological 
modeling,  and  decision  analysis  demonstrate  the  use  of  the  concepts  and  methods 
developed. 

Why  Evaluation  Functions  Are  Restricted 

The  most  severe  restriction  on  a  function  is,  of  course,  the  specification 
of  its  functional  form  and  parameters,  which  determines  each  point  of  its  graph. 
At  the  other  extreme  one  may  only  know  that  f  is  a  function.  Between  these 
extremes  there  are  more  or  less  severe  confining  properties  such  as  convexity, 
continuity,  boundedness,  number  of  minima  and  maxima,  etc.  Assume,  for  example, 
that  all  we  know  about  the  function  y  =  f(x)  is 

(a)  it  is  defined  for  0  §  x  £  1  and  bounded  between  y  =  0  and  y  =  1; 

(b)  it  is  strictly  convex; 

(c)  it  is  continuous; 

(d)  it  has  a  unique  ninimum. 

Figure  la  gives  some  examples  of  graphs  of  functions  which  satisfy  (a)  -  (d). 
Figure  lb  illustrates  some  inadmissible  cases. 


Insert  Figures  la  and  lb  about  here 


This  section  will  present  the  mathematical  proof  that  evaluation  functions 
in  linear  optimization  models  have  confining  properties  like  the  ones  discussed 
in  this  example.  The  substance  of  our  argument  are  three  theorems  from  satis- 
tical  decision  theory,  which  are  proven  in  Ferguson  (1967)  and  PeGroot  (1^70). 
The  arguments  and  proofs  are  quite  technical,  but  all  theorems  have  a  simple 
intuitive  meaning,  and  except  for  theorem  3  they  seem  self  evident.  Rather 
than  boring  the  reader  with  messy  mathematics,  we  will  rely  on  self-evidence, 
whenever  possible,  and  confine  ourselves  to  interpretation.  The  reader  inter¬ 
ested  in  more  mathematical  detail  should  consult  the  two  references  cited.  For 
ilia*  ration  a  scoring  rule  example  will  accompany  all  theorems  and  proof 
arguments. 

We  want  to  study  the  behavior  of  the  model  evaluation  function  U  which  is 

defined  over  a  set  of  choice  alternatives  X  =  [x,£,z, . ].  For  example,  X 

may  be  a  set  of  gambles,  decision  functions,  or  multi -attributed  outcomes;  U 
may  be  a  utility  function  or  an  expected  utility  function.  In  our  scoring  rule 
example  X  will  be  a  set  of  probability  estimates  which  are  gambles  by  the  defi¬ 
nition  of  a  scoring  rule;  U  is  the  subjective  expected  value  (SEV)  of  those 
gambles . 

The  application  of  a  linear  decomposition  model  to  such  a  choice  situation 
requires  each  x  to  be  described  as  an  n-tuple  of  elements  characterizing  x 
for  a  specific  aspect  or  state  S  of  the  choice  situation.  We  assume  therefore 
that  x  has  the  following  representation: 


1| 


"l  S2 

*  =  X2> 


n 


,Xn} 


For  example,  x  may  be  a  gamble  in  which  one  receives  a  dollar  amount  ^  it 
event  S.  occurs;  or  a  multi-attributed  outcomes  with  value  in  attribute  S^, 
or  a  cash  flow  in  which  one  receives  a  dollar  amount  at  time  S^.  Note  that 
by  labelling  we  implicitly  let  the  number  of  states  be  finite.  This  finiteness 
of  the  state  space  will  be  our  first  assumption  (Al)  for  the  further  mathematical 
development . 

Linear  decomposition  models  go  further  by  defining  utility  functions  u^ 

within  each  state  S  so  that  each  choice  alternative  can  now  be  characterized  by 

i 

a  vector  of  single  state  utilities: 


u(x)  =  (u1(x1),  u2(x2), . ,ui(xi), . ,un(xn)) 

According  to  our  second  assumption  (A2)  these  utilities  are  bounded,  i.e., 
m  £  u.(x  )  £  M  for  all  i  and  x  €  X,  and  some  real  m,M. 

Furthermore,  ,in  linear  optimization  models  a  weight  vector  w  from  a  param¬ 
eter  set  W  associates  with  each  state  a  weight  w^  which  can  be  interpreted 
as  a  subjective  probability,  an  importance  weight,  or  a  discounting  rate.  In 
expectation,  time  discounting,  and  multi -attribute  models  we  can  assume  that 

w  ^  0  and  £  w  =  1. 

1  =1  1 

The  linear  model  evaluate  ,  choice  alternatives  x  now  by  computing  the 
scalar  product  w  o  u  of  the  vectors  u  and  w,  or  more  simply  as  the  weighted 
average: 
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u(x,w) 


(l) 


n 

=  wou(x)  =  £  w  u  (x  ) 
i=l  "  1  i 

That  alternative  x*  is  optimal,  which  maximizes1  U,  i.e.,  the  decision  rule  of 
linear  optimization  models  is: 

"choose  x*  with  U(x*,w)  >  U(x,w)  for  all  x  c  X" 

We  will  define 

U*(w)  =  U(x*,w)  (2) 

i.e.,  U*  is  the  maximal  attainable  utility  for  a  specific  weight  vector  w.  In 
statistical  decision  theory  x*  would  be  called  a  Bayes  decision  with  respect  to 
the  prior  distribution  w. 

Let  us  interpret  the  previous  paragraph  in  the  scoring  rule  situation, 

assuming  a  simple  two  state  case,  in  which  and  S2  are  two  mutually  exclusive 

and  exhaustive  events,  w^  and  w^  are  the  associated  true  subjective  probabilities 

2 

(SP's).  The  set  of  choice  alternatives  X  is  a  subset  of  the  real  plane  R  , 

• 

namely  the  tuples  (x^x^  with  0  <  xi  <  1  and  x^  +  x2  =  1.  The  x^'s  are 
interpreted  as  the  stated  probabilities  of  the  events  S^.  Since  x  =  l-x^  the 
choice  set  can  be  totally  characterized  by  the  real  numbers  between  0  and  1. 
Scoring  functions  u^  and  ug  are  defined  for  each  state  S  such  that  U(x,w)  = 

SEV( x,w)  is  maximal  if  x  -  w.  We  will  specifically  analyze  the  quadratic  scoring 
rule  in  which: 


^■Our  whole  argument  will  be  based  on  maximazation.  The  dual  argument  based 
on  minimization  is  basically  the  same. 


ul(xl> 


u?(x,,)  =  1-x/ 


Schematically  the  scoring;  rule  paradigm  is  represented  in  Table  1. 


Insert  Table  1  about  here 


Here,  of  course,  U*(w)  has  a  very  clear  interpretation:  U*(w)  =  U(x*,w)  = 

U(w,w) . 

Before  we  enter  into  a  discussion  of  the  behavior  of  U  and  U*,  we  need  to 
state  two  preliminary  theorems,  which  will  establish  a  relation  between  the 
parameter  set  W  and  the  choice  set  X. 

THM  1  Assuming  that  the  state  space  if  finite  (Al)  and  that  the 
Ui(Xi)  are  bounded  (A2) ,  there  exists  for  every  w  e  W  at 
least  one  x  e  X  such  thao  U(x,w)  =  U*(w). 

We  will  define  the  subset  of  X,  which  contains  those  elements  x  which  are  otpimal 
under  w  as  X  ,  and  elements  of  X  as  x  .  A  similar  theorem  is  proven  in 
Ferguson  (1967).  It  seems  self  evident  for  finite  X:  you  just  order  the  x’s 
according  to  their  U- values  (all  of  which  are  finite  by  Al  and  A2)  and  choose 
the  x  with  the  maximal  U. 

The  second  theorem  is  more  sophisticated  and,  in  fact,  substantial  work  is 
based  on  it  in  decision  theory.  To  state  it,  we  first  have  to  introduce  the 
notion  of  dominance  (here  in  a  somewhat  wider  sense  than  usual).  We  call  a 
choice  alternative  x  dominated,  if  there  exist  other  alternatives  y  and  z  and 
a  real  number  a  (0  <  a  <  l)  such  that 
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and 


(1)  ui(xi)  <  au^yj  +  (l-a)u  (z  )  for  all  i  (4a) 

(2)  u^x^)  <  aui(y1)  +  (l-a)u  (z  )  for  some  i.  (4b) 

The  set  of  non-dominated  alternatives  is  called  admissible.  We  label  the 
admissible  subset  of  X  as  X,  and  we  will  assume  in  the  following  that  X  =  X 
(A?). 

THM  2  Given  Al,  A2,  and  A3,  there  exists  for  all  x  e  X  at  least 
one  w  e  W  such  that  U(x,w)  =  U*(w). 

Similarly  to  theorem  1  we  will  define  the  subset  of  W  which  contains  those 

parameters  w  which  would  make  x  an  optimal  choice  W  ,  and  elements  w  e  W  we 
~  ~  x  -  x 

will  call  w  .  This  theorem  is  rather  difficult  to  prove  and  requires  a  sub- 

“X 

stantial  number  of  "lemmas"  such  as  the  famous  separating  hyperplane  theorem. 

The  idea  of  theorem  2,  however,  is  simple:  admissible  choice  alternatives  are 
potential  candidates  for  optimal  choices. 

Theorems  1  and  2  allow  us  to  step  freely  from  the  parameter  set  W  to  the 
choice  set  X  and  back  in  our  analysis  of  U  as  a  function  of  both,  w  and  x. 

The  main  purpose  of  these  theorems  here  is  to  establish  an  equivalence  between 
parameters  and  choice  alternatives  for  the  insensitivity  analysis. 

Both  theorems  have  a  simple  interpretation  in  our  scoring  rule  example. 
Since  here  X  =  W  and,  by  definition  of  a  proper  scoring  rule  X  =  X,  the  theorems 
say  that  for  each  true  subjective  probability  vector  w,  there  is  an  optimal 
probability  estimate  x,  and  for  each  estimate  x  there  is  a  subjective  prob¬ 
ability  vector  w  which  'rould  make  this  estimate  optimal.  In  fact,  we  already 
knew  that,  since  the  unique  value  x  =  w  was  the  best  estimate  in  the  SEV  sense. 

f\ 


■■■■■ 


After  these  preliminary  theorems  we  are  now  able  to  study  the  restrictions 
on  U  and  U*.  By  the  definition  of  U*  and  by  the  properties  of  linear  opti¬ 
mization  models,  we  know  that 


(1)  the  range  of  U*  will  be  the  restricted  range  of  W, 

(2)  U*  has  to  go  through  all  the  corner  pc  c  [sup  =  1] 

X 

But  our  third  theorem  imposes  a  much  more  severe  restriction  on  U*: 

THM  3  Under  A1  and  A2  U*  as  defined  in  (3)  is  a  convex  function 
of  w,  i.e.,  U*[av  +  (l-a)w]  <  aU*(v)  +  (l-a)U*(w)  for  all 
0  <  a  <  1,  v,w  e  W, 

The  proof  is  rather  simple,  and  it  is  presented  here,  since  the  convexity  of  U* 
is  not  at  all  self  evident.  For  a  different  version  of  the  proof,  see  DeGroot 
(1970).  Consider  the  vector  av  +  (l-a)w.  From  theorem  1  we  know  that  there  is 
at  least  one  x  such  tha+ 


U[x, av  +  (l-a)w]  =  U*[av  +  (l-a)w]. 


By  definition  of  U 


U[x,av  +  (l-a)w]  =  [av  +  (l-a)w]  ou(x)  =  avou(x)  +  (l-a)  wouCx,.  (6) 


The  latter  equality  follows  from  the  distributivity  of  "o."  Again  by  theorem  1 
there  exist  y  and  z  (  X  such  that 


U(jr,v)  =  U*(  v) 
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U(z,w)  =  U*(w) . 


(8) 


Since 


U*(v)  >  U(x,v)  =  vou(x)  (C)) 

and 

U*(w)  >  U(x,w)  =  wou(x)  (10) 

it  follows  by  substitution  that 

U*[av  +  (l-a)w]  =  aU(x,v)+  (l-a)U(x,w)  <  aU*(v)  +  (l-a)U*(w).  (11) 

What  does  this  theorem  mean  in  our  scoring  rule  example?  Defining  U*(w) 
as  U*(w^),  we  see  that  U*  is  sever ly  restricted  through  the  boundaries  and  by 
convexity.  Figure  2a  gives  some  examples  of  graphs  of  U*  functions  which  might 

have  been  generated  by  some  scoring  rule  (actually,  U*  is  equivalent  to  some 

Insert  Figures  2a,  2b,  and  2c  about  here 

scoring  rule).  Figure  2b  shows  inadmissible  graphs.  Figure  2c  shows  the  U* 
function  for  our  quadratic  scoring  rule.  The  interpretation  of  convexity  in 
this  example  is  very  intuitive:  the  more  certain  you  are  about  the  events  S^, 
the  better  your  optimal  decision  will  be  in  terms  of  SEV. 

We  know  now  that  U*  is  a  restricted  function  of  w,  but  what  about  U  as  a 
function  of  x?  With  theorems  1  and  2  it  becomes  simple  to  step  from  U*  to  U. 

U  has  two  arguments,  w  and  x.  We  know  that  U  is  linear  in  the  w^'s,  thus  as  a 
function  of  w  U  defines  an  n-1  dimensional  hyperplane.  In  the  scoring  rule 
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case  U  is  a  line  as  illustrated  in  Figure  3. 

Insert  Figure  3  about  here 

What  do  these  lines,  planes,  and  hyperplanes  ha^e  to  do  with  U*?  First, 

U  is  defined  on  the  same  space  W  on  which  U*  is  defined.  Second,  U(x,w)  and 
u*(w)  have  at  least  one  point  in  common,  namely  the  point  [U*(w  )  w  ].  Third 
u*  is  everywhere  at  least  as  large  as  U,  i.c.,  U*(u)  >  ll(x,w)  for  all  w  e  W 
and  x  e  X.  This  last  fact  follows  by  sinple  contradiction.  If  U*  was  not  at 
least  as  large  as  U  for  all  w  and  xf  then  there  would  exisc  some  x  and  w  such 
that  U(x,w)  >  U*(w),  which  contradicts  the  definition  of  U*(w) .  Therefore, 
as  a  line  U  is  a  tangent  to  U*,  as  a  plane  it  is  a  tangent  plane,  and  as  a 

hyperplane  it  is  a  tangent  hyperplane  to  U* .  Figure  b  clarifies  these  concepts 
in  our  scoring  rule  example. 

Figure  4  also  exemplifies  how  the  restrictions  on  U  and  the  possible  losses 
AU  are  determined  totally  by  the  shape  and  the  slopes  of  U*.  All  losses  which 

Insert  Figure  k  about  here 

may  be  encountered  in  a  choice  situation  (whether  they  are  due  to  a  suboptimal 
choice  or  the  use  of  a  wrong  set  of  parameter  values)  are  differences  between 
U*  and  some  hyperplane  tangent  to  it.  Assume  in  a  two  state  case  you  could 
construct  U*  without  restrictions  and  you  wanted  to  make  losses  around  a  value 
z  as  large  as  possible  within  the  boundaries  of  u  .  You  probably  would  con¬ 
struct  a  U*  function  which  looks  somewhat  like  in  Figure  5.  But  by  convexity  of 


Insert  Figure  5  about  here 
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U*  this  shape  is  inadmissible.  The  convexity  of  U*  will  make  losses  in  the  area 
of  the  optimal  choice  alternative  small. 

This  intuitive  interpretation  of  the  restrictions  on  the  behavior  of  U 
around  its  maximum  through  the  convexity  of  U*  can  also  be  expressed  mathema¬ 
tically.  Since  U  is  a  tangent  hyperplane  to  U*,  it  can  be  totally  determined 

by  n-1  slopes  and  one  point.  Assuming  that  U*  is  differentiable  at  w  ,  the 

—x 

actual  formula  for  U  in  terms  of  U*  is  consequently: 


U(x,w)  =  U*(w  )  +n£1  d  (w  -  w  ) 
— x  i=i  i  i  ix 


(12, 


where  d  -  du*/dw  |  w  ,  i.e.,  the  directional  slope  of  U*  evaluated  at  w 

i  l  ~x  — x* 

How  much  do  we  stand  to  lose  by  the  choice  of  an  nonoptimal  alternative? 
Assume  that  w^  is  the  true  weight  vector,  £  is  the  optimal  choice  alternative, 
but  instead  we  choose  x  /  ^r.  Since  by  definition 

u(2C,Ey)  =  u^(Wy)  (13) 

and 


u(^) 


U*(w  )  +nf^ 

-x  1=1 


dl(V 


W.  )  . 

IX 


(1U) 


we  will  lose 


AU  =  B.^)  -  U*^) 


(15) 


The  convexity  of  U*  puts  limits  on  the  differences  between  the  U*'s  as  well  as 

on  the  slopes  d  .  Since,  in  addition  (wj  -  w,  )  cannot  exceed  1  (and  will 
i  iy  ix  v 

typically  be  much  smaller)  the  loss  AU  will  remain  small. 
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/ 

How  wieh  will  we  lose  if  we  base  our  decision  on  a  parameter  value  v 
when,  in  fact,  the  true  value  is  w?  We  would  choose  x  such  that 


U(x,v)  =  U*(  v) 


We  will  receive 


(16) 


U(x,w)  =  U*(v)  +nf1  d.(w  -  v  ) 

~  i=l  1  1  i 

and  consequently  we  will  lose 


(17) 


AU 


U*(w)  -  U*( v)  d  (w  - 

“  i=l  i  i 


(18) 


Two  general  expressions  may  be  helpful  for  limiting  purposes:  the  maximum 
possible  loss  is  determined  by 


AU  =  max  {U*(e,  )-U*(f  )-d  +  d  ) 

max  k  ,1  “k  -r  k  l 


(19) 


where  e  and  f  are  the  unit  weight  vectors  with  e  =  1  and  f  =1.  See 

K  1  k  i 

Figure  6  for  illustration. 


Insert  Figure  6  about  here 


But  this  loss  would  only  result  from  an  extremely  foolish  choice.  By 
choosing  such  that  the  value  U(y_,w^)  is  the  minimal  point  of  U*  (in  decision 

theoretic  terms  y  ?s  the  minimax  strategy),  we  can  reduce  the  maximum  possible 
loss  to 

‘Vnirnax  '  T  (u1<xi))  "  (20) 

See  Figure  7  for  illustration. 


1? 


Insert  Figure  7  about  here 

By  now  it  should  be  clear  how  to  do  a  sensitivity  analysis  with  the  tools 
developed.  The  first  step  is  to  construct  the  function  U*.  Often  this  can  be 
done  explicitly.  If  an  explicit  solution  is  not  possible  or  too  difficult  and 
time  consuming,  one  can  help  oneself  with  the  following  procedure:  first,  plot 
the  cornerpoints  U*(e  ),  then  determine  U*(w)  where  w  is  the  "least  favorab?.e" 

K. 

weight  vector  which  would  make  a  minimax  choice  optimal.  Then  find  some  other 
points  of  U*  and  exploit  the  convexity  property  to  approximate  the  whole 
function.  Alternatively  U*  can  be  approximated  by  plotting  some  U  -  lines. 

This  procedure  can  be  done  graphically  in  two  state  cases.  In  cases  with  a 
larger  number  of  states  computer  aid  is  needed.  Equations  (19)  and  (20)  give 
some  boundary  losses,  and  equation  (it;  ^termine  for  each  particular  case  the 
potential  losses.  In  general:  the  flatter  U*  as  a  function  of  w,  the  flatter 
U  as  a  function  of  x  will  be  around  its  maximum. 

To  summarize  this  section:  First  we  established  a  relation  between  the 
parameter  set  and  the  choice  set  in  two  therorems  by  making  three  assumptions. 
We  assumed  that  the  state  space  is  finite  (Al),  that  the  single  state  utility 
functions  are  bounded  (A2),  and  that  the  choice  set  is  admissible  (A3).  Then 
we  showed  that  under  Al  and  A2  in  linear  optimization  models  the  function  U* 
is  severely  restricted  by  its  boundaries  and  through  convexity.  Finally,  we 
demonstrated  the  restrictions  on  the  actual  evaluation  function  U  as  a  function 
of  U*  and  outlined  a  general  approach  to  sensitivity  analysis  using  the  prop¬ 


erties  of  U*. 


The  next  section  will  give  some  examples  to  demonstrate  the  ^concepts  and 
methods  developed. 


Examples 

A  Signal  Detection  Example 

We  assume  a  simple  two  state  signal  detection  situation  in  which  a  datum  d 

may  be  sampled  from  either  of  two  normally  distributed  populations  S  or  S2. 

These  distributions  have  equal  variances  s  =  1  and  different  means  m  and  m  . 

1  2 

Two  decisions  can  be  made  upon  observing  d: 

(1)  a.  :  d  was  sa.rT>led  from  S  , 

j.  -  1' 

or 

(2)  a2:  d  was  sampled  from  S2< 

The  prior  probability  for  sampling  from  is  w^,  with  w1  =  1  -  w2«  Payoffs 
are  1^  for  correct  decisions,  0  for  incorrect  decision?. 

The  choice  set  X  here  is  the  set  of  real  valued  decision  functions  x, 
which  are  cutoffs  along  the  possible  real  values  of  d  (x  is  in  this  case 
related  to  the  usual  likelihood  ratio  criterion  P  by  x  =  lnP/d').  x  is  eval¬ 
uated  by  a  simple  expected  value  model. 

To  formulate  the  problem  in  the  format  of  he  preceding  section,  we  con¬ 
struct  t  le  expected  value  matrix,  where  expectations  are  taken  over  the  random 
variable  d  within  each  state  S  .  This  matrix  indicates  for  each  x  the  expected 
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Insert  Table  2  about  here 

amount  of  money  the  decision  maker  stands  to  lose  under  (see  Table  2).  The 
expected  values  are  defined  as 


EV(x|S1)  =  Pr(d<x|Si) 

(21) 

EV(x|fl2)  =  Pr(d>v|S2). 

(22) 

As  in  the  scoring  rule  example,  we  ha/e  in  this  paradigm  a  1:1  mapping 
from  prior  probabilities  w  into  the  choice  alternatives  x.  As  it  is  well 
known  P*,  the  optimal  likelihood  ratio  criterion  for  the  payoffs  given  is 

P*  =  w^l-w^  (25) 

and  consequently 

x*  “  In  Cw1/(l-w1)]/d'  (2k) 

where  x*  is  the  optimal  cutoff  value  under  w, ,  or  w  e  W  . 

1  lx* 

Insert  Figure  8  about  here 

Figure  8  shows  the  U*  function  as  determined  from  Table  2. 

U*(w1)  =  W]Pr(d  <  x*/Sx)  +  (l-Wl)Pr(d  >  x*/S2).  (25) 

On  the  abscissa  we  have  ordered  the  x*-values  under  w  to  show  how  they  are 
related. 

Assume  now  that  w  *  .5  is  the  true  prior  probability,  but  instead  of 


x*  =  0  we  choose  some  other  x'  =  +  oo  which  would  be  optimal  under  w  =  1. 
Figure  H  demonstrates  the  possible  loss  AU  we  expect  in  this  case. 

We  see  how  the  flatness  of  U*  prevents  this  loss  from  being  large,  v. 
Winterfeldt  and  Edwards  (1972)  showed  in  a  direct  analysis  of  the  U-function, 
that  U  is  generally  flat  in  signal  detection  situations. 


A  Multi-Attribute  Example 

Assume  that  we  have  two  attributes  on  which  we  evaluate  riskless  options, 

say  job  offers.  Attribute  may  be  salary,  may  be  staff  benefits.  We  have 

five  offers,  each  of  which  has  been  evaluated  by  a  utility  function  u  in  each 

i 

Insert  Table  3  about  here 


attribute  (see  Table  3).  We  can  immediately  delete  x  since  it  is  dominated: 

3 


u(z)  =  l  u^)  +  ±  u(x^)  =  (8,  10) 


(26) 


i.e 


•  y 


W  >  Ui(x51)  and  W  -  U2(X32^  (27) 

All  other  alternatives  are  admissible.  U*  in  this  case  will  be  piecewise 
linear,  and  its  construction  is  rather  easy.  We  just  plot  all  the  functions 

LKx^)  =  w1ul(xil)  +  (l"Wl)u2(Xi2}  (28) 

(see  figure  9),  Naturally  U*  is  defined  by  the  line  segments  of  U  such  that 

U^,Wi)  -  U^-j,VV  f°r  a11  (29) 
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(in  Figure  9  marked  by  the  solid  line).  Assume 


now  that  we  choose  for 

Insert  Figure  9  about  here 

W1  =  Figure  9  also  indicates  what  we  will  stand  to  lose. 

Similar  analyses  can  be  done  with  any  matrix  like  the  one  in  Table  3,  ns 
we  find  them  in  time  discounting  models  or  simple  decision  analysis  problems. 
For  more  than  two  states  graphical  representations  become  impossible,  and 
computer  aid  is  needed.  In  those  cases  one  should  use  the  approach  of  bounding 
losses  by  the  slopes  and  points  of  U*  as  sketched  in  the  previous  section. 
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Table  1 


Schematical  representation  of  the  scoring  rule 
situation  with  quadratic  scoring  functions 
(SEV(Xl)  -w^Ml-x^S]  +  (l-w1)(l-x12)). 


Table  3 


Multi -attributed  outcomes  described 
by  their  single  attribute  utilities 


Figure  lb 


Graphs  of  functions  which  do  not  satisfy  (a)  -  (d) 


Graphs  of  hypothetical  U* -functions  which  satisfy  the  boundary 
conditions  U*(0)=k  and  U*(l)  =  1  and  convexity. 


WEIGHT  w 


Graphs  of  non-admissible  U*  functions 


WEIGHT  w 


Figure  4 


Demonstration  of  the  relation  between  the  hyper- 
lniK..  U(£,w)  .ind  U'-‘(w)  in  ||„.  nooning  rule  example 


TRUE  SUBJECTIVE  PROBABILITY  OF  S 


Figure  5 


Inadmissible  U*  function  (with  foundary  values  k  and  1)  which 
would  make  the  loss  due  to  small  deviations  from  an  optimal 
choice  severe. 


Figure  7 


Graphical 

under 


determination  of  the  maximum  possible  loss 

a  minimax  choice  y  (AU  .  .  ) 
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figure  8 


U  and  U*  functioi^  in  the  signal  detection  example 


PRIOR  PROBABILITY  OF  S 


MULTI  ATTRIBUTE  UTILITY  U  FOR  EACH  CHOICE 
ALTERNATIVE  AND  WEIGHT  VECTOR  w.U  (x,. 


